Improving Logical Structure Analysis of Visually Structured Documents with Textual Features

نویسندگان

چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Logical Structure Analysis and Generation for Structured Documents: A Syntactic Approach

This paper presents a syntactic method for sophisticated logical structure analysis that transforms document images with multiple pages and hierarchical structure into an electronic document based on SGML/XML. To produce a logical structure more accurately and quickly than previous works of which the basic units are text lines, the proposed parsing method takes text regions with hierarchical st...

متن کامل

Grammatical Approach for the Physical and Logical Structure of Documents Analysis; Application to Summary Documents

This paper deals with the use of grammatical formalism to recognize the physical and the logical structures of a composite document. We propose a new system for document recognition and analysis. The aim of our research is to create a document structuring system by using a two level grammar. A two level grammar constitute a high level formalism of expression and structuration in document analys...

متن کامل

Document Structure Analysis Based on Layout and Textual Features

Document image processing is a crucial process in the office automation and begins from the ’OCR’ phase with difficulty of the document ’analysis’ and ’understanding’. This paper presents a hybrid and comprehensive approach to document structure analysis. Hybrid in the sense, that it makes use of layout (geometrical) as well as textual features of a given document. These features are the base f...

متن کامل

Recognising Textual Entailment with Robust Logical Inference

We use logical inference techniques for recognising textual entailment, with theorem proving operating on deep semantic interpretations as the backbone of our system. However, the performance of theorem proving on its own turns out to be highly dependent on a wide range of background knowledge, which is not necessarily included in publically available knowledge sources. Therefore, we achieve ro...

متن کامل

Summarisation of the logical structure of XML documents

Summarisation is traditionally used to produce summaries of the textual contents of documents. In this paper, it is argued that summarisation methods can also be applied to the logical structure of XML documents. Structure summarisation selects the most important elements of the logical structure and ensures that the user’s attention is focused towards sections, subsections, etc. that are belie...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computer Science and Information Systems (FedCSIS), 2019 Federated Conference on

سال: 2022

ISSN: ['2300-5963']

DOI: https://doi.org/10.15439/2022r26